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Abstract 

This paper presents a procedure for estimation of the Kalman filter gain from output residuals. 
The system state space model is assumed to be known, but the process and noise covariances are 
unknown. The proposed procedure consists of three basic steps. First, the output residuals are 
computed from the given model and a given set of input-output data. Second, a linear regression 
model for this part of the response is computed by a least-squares solution. Third, the Kalman 
filter gain is then estimated from the coefficients of this model. Numerical results using 
experimental data are presented to illustrate the validity of the developed procedure. 


Introduction 


A state space model of a linear system describes the system input and output via a quantity 
called the state vector. There is a class of controllers that uses the state information to compute the 
control input. However, the state vector itself is typically not accessible for direct measurement. A 
state estimator, also known as an observer, can be used to provide an estimate of the system state 
from input and output measurements. 

In the presence of process and measurement noises, under ideal conditions, an optimal 
observer is the Kalman filter. The Kalman filtering problem has been studied for several decades. 
To compute the Kalman filter gain, the system model must be known, and the individual process 
and measurement noise covariances must also be known. In practice, these are somewhat 
restricted requirements, since neither the system nor the noise characteristics can be known exactly. 
Nevertheless, a mathematical model of the system can be derived analytically, or experimentally 
from input-output measurement data by a system identification method. An estimate of the 
measurement noise covariance may be obtained by examining the response of the sensor devices. 
The process noise covariances, however, is almost impossible to be obtained by direct 
measurement and some guess work is required. 
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It is thus difficult to determine accurately the individual process and measurement noises 
characteristics. But, collectively, their information is present in the system input-output data. It is 
the purpose of this paper to formulate a procedure to estimate the Kalman filter gain directly from 
knowledge of the system and input-output data without the need to identify the process and 
measurement noise covariances individually. The problem of identifying both the system and the 
Kalman filter gain is studied in Refs. 1-5. In practice, due to the presence of other factors such as 
disturbances, non-linearities, non-whiteness of the process and measurement noises, etc... the 
resulting identified filter is not the Kalman filter. In such a case, the identified filter is simply an 
observer that is computed from input-output data that minimizes the filter residual in a least-squares 
sense. 

The outline of this paper is the following. First, a mathematical statement of the problem is 
presented. The main section that follows formulates a least-squares solution for computing the 
coefficients of the model that describes the stochastic portion of the response. The Kalman filter 
gain is then estimated from these coefficients. The derivation is done in the time domain. For 
better understanding of the formulation, an interpretation in the z-domain is also provided. 
Numerical examples are given to illustrate the method proposed in this paper. 


Statement of the Problem 

Consider a linear discrete multivariable system in state space format 

x{k + 1) = Ax(k) + Bu(k ) + w,(A) 

y(k) = Cx(k) + w 2 (k) 


( 1 ) 


The process noise w t (k) and measurement noise w 2 (k) are two statistically independent, zero- 
mean, stationary white noise processes. It is known from Kalman filter theory that there exists an 
optimal filter for the above system of the form 


x(k + 1) = Ax(k) + Bu(k) + Ke(k) 
y(k) = Cx(k) + Du(k) 


( 2 ) 


where £ (k) is a white sequence of the Kalman filter residual, 

£(k) = y(k)-y(k) 

The filter gain A' is a function of the system parameters and the process and measurement noise 
covariances. Suppose that a set of input-output data of the system given in Eq. (1) is available in U 
and y as 


u = [w(0) u( 1) ••• 

y=b«>) y( i) - 

The objective of the problem is to estimate the Kalman filter gain K from a given set of data. The 
system matrices A, B.C.D are assumed to be known. The statistical properties of the noises in the 
system, however, are assumed not known. 


u(N)] 

y(N)] 


( 3 ) 
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Mathematical Formulation 


For simplicity of presentation, this section is divided into various sub-sections, each will deal 
with a specific aspect of the formulation. The problem is formulated in the following basic steps. 
First, since the system parameters are assumed known, it is possible to compute the deterministic 
and stochastic portions of the response for the given set of input data. This step will simplify the 
subsequent mathematics. Next, a least-squares solution for computing the coefficients of the 
model that describes the stochastic portion of the response is formulated. Finally, the Kalman filter 
gain is estimated from these coefficients. 

1. Deterministic and Stochastic Portions of the Response 

First, the mathematical problem defined in the previous section can be simplified considerably 
by observing that it is possible to partition the Kalman filter model into two parts as 

x l (k + \) = Ax i (k) + Bu(k) 

(4) 

y x (k) = CxAk) + Du(k) K 9 

and 

x 2 (k + l)= Ax 2 (k) + Ke(k) 

y 2 (k) = Cx 2 (k) + e(k) (5) 

where x(k) = x ] (k) + x 2 (k) and y(k) = y ] (k) + y 2 (k). The quantities jc, (A) and y,(fc) are portions 
of the state and output, respectively, caused by the known deterministic input u(k). The quantities 
x 2 (k) and y 2 (k) are portions of the state and output caused by the unknown stochastic input £(k). 

Assuming zero initial conditions, *,(()) = 0, for the given set of input-output sequence, the 
deterministic and stochastic portions of the output can be easily computed. To see this, note that 
y,(*)can be easily found from the known values of A, B, C, D and u(k) as 

it 

y l (k)='£CA i ~'Bu(k-i) + Du(k) (6) 

*=1 

The parameters Y(i) = CA‘~'B in the above expression are known as the Markov parameters of the 
system. Knowing y,(&), the stochastic portion of the output, y 2 (k), can be computed from 

yi(k) = y(k)- yi (k) (7) 

Hence, the mathematical problem reduces to that of finding K from Eq. (5), provided that A, C, and 
y 2 (k) are known. In this paper, the stochastic portion of the output, y 2 (k), is also referred to as 
output residuals. 

2. Computation of a Model for the Stochastic Portion of the Response 

A procedure to compute a model for the stochastic portion of the response given in Eq. (5) is 
presented in this section. Equation (5) can also be re-written as 
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( 8 ) 


x 2 (k + 1) = (A - KC)x 2 (k) + Ky 2 (k) 

y 2 (k) = Cx 2 (k) + e(k) 


Assuming zero initial conditions, i 2 (0) = 0, from Eq. (8), 

y 2 (k) = j^C(A - KC) iX Ky 2 (k-i ) + e(k) (9) 

i=i 

For simplicity of notation, define the parameters 

a* = C(A - KC) k ] K , k = 1, 2, p (10) 

The matrix A - KC is the Kalman filter system matrix, which is known to be asymptotically stable. 
Hence, for some sufficiently large value of p, 

(A-KCf'.O, i>p (11) 

Equation (9) can then be approximated by a finite set of parameters a,, a 2 , ..., a p as 

yi(k) = 2^ a,y 2 (k - i) + e(k) (12) 

i=i 

Writing Eq. (12) in matrix form for the given set of data 

y = «Y + e (13) 


where 


y = [y(0) y(l) — y(p) y(p + 1) — y(/V)] 

a=[a, a 2 a„] 

>z(0) y 2 (l) ••• y 2 (p) y 2 (p + 1) — y 2 (N) 

y 2 (0) - y 2 (p- 1) y 2 (p- 2) - y 2 (N- 1) 

• ♦ • • t 

* • • • t 

> 2 (0) y 2 (i) - yi(N-p) 

e = [£(0) £(1) ••• e(p) £(p + 1) ... £(A/)] 


The residual time history of the Kalman filter is known to be minimized and orthogonal to the data 
sequence. To obtain an estimation of the Kalman filter with a finite data record, we compute a by 
the least-squares solution. 


a = yY + 


(14) 
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where the superscript + denotes the pseudo- inverse. The parameter matrix a contains the 
coefficients for a model of the stochastic portion of the response from which the Kalman filter gain 
K can be estimated. This is shown in the following section. 

3. Estimation of the Kalman Filter Cain 

Finally, the filter gain K can be computed from the above least-squares solution for a by the 
following procedure. First define the Kalman filter gain Markov parameters 

p k =CA k ~'K , * = 1, 2, .... p (15) 

Making use of the definitions of a k and A given in Eq. (10) and Eq. (15), the parameters 
A , A ..... A can be computed from a, , a 2 ,..., a p by the recursive equation 

A =«* + XA-,«, » k = \,2,...,p ( 16 ) 

i=1 

Since A and C are known, the gain matrix K can now be solved from 

K = (V f V) ' V t Yk ( 17) 

where V is an observability matrix from A and C, and Y* is a matrix formed by the parameters 
A = CA kl K, k = 1, 2, ..., p computed in Eq. (16). 



■ c ' 


CK 


"A" 

v = 

CA 


CAK 

= 

A 


CA P ~ 1 


1 

2 

i 

* 

i 


A. 


Z-Domain Interpretation 

This section provides an explanation of the above time-domain derivation by examining the 
equations in the z-domain. First, it is of interest to examine various input-output description in the 
z-domain. The results will then be specialized for the problem considered in this paper. 

1. Input-Output Models in the Z-Domain 

Assuming that the system in Eq. (2) is asymptotically stable, the system input-output 
relationship can be written in terms of the Kalman filter as 

y(k) = £ CA-'Bu(k - i) + Du(k) + £ CA^Ke{k - i) + e(k) (18) 

i=l *=1 

where the order of the linear difference equation, q, is sufficiently large such that A k is negligibly 
small for k > q. Taking the z-transforms of both sides of Eq. (18) yields 
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Y(z) = M(z)U(z) + N(z)E(z) 


(19) 


where the transfer functions M(z) and N(z) are given as 

M(z) = £OV _1 flz~‘ + D , N(z) = ]£ CA‘- l Kz-‘ + 1 

i = 1 i=l 

Note that the coefficients in M(z) are the Markov parameters of the system, and the coefficients in 
N(z) are the Kalman filter gain Markov parameters discussed previously. Together they contain 
information about the system and the Kalman filter. The z-transfer functions of the deterministic 
and stochastic parts of the response are Y,(z) and Y 2 (z), respectively 

Y,(z) = M(z)U(z) , Y 2 (z) = N(z)E(z) (20) 

Y(z) = Y,(z) + Y 2 (z) 

On the other hand, Eq. (2) can also be re-written as 

x(k + l) = (A - KC)x(k) + ( B - KD)u(k) + Ky(k) 

( 21 ) 

y(k) = Cx(k)+Du(k) + e(k) y ’ 

which yields the following input-output relationship 

y{k) - £ C(A - KC)‘~' Ky(k - /) = Du(k) + ^C(A - KCf'(B - KD)u(k - i) + e(k) (22) 

1=1 1=1 

where the order of the linear difference equation, r, is sufficiently large such that (/4 -KC) k is 
negligibly small for k > r . Taking the z-transforms of both sides of Eq. (22) yields 

P(z)Y(z) = Q(z)U(z) + E(z) (23) 


where 


P(z) = / - £C(A - Key ' Kz ' = / - 

i* I i=t 

Q(z) = D+^C{A - KC)‘~\B - KD)z i 

i=i 

Comparing Eq. (19) to Eq. (23) suggests that 

M(z) = P(z)-'Q(z) , N(z) = P(z)-' (24) 

A proof of the above relation is provided in the appendix. The significance of the above relations 
is that if the coefficients of P(z) and Q(z) are known, then M(z) and N(z), which characterize the 
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system and the Kalman filter, can be computed. In fact, knowledge of the coefficients of M(z) and 
N(z) is sufficient to compute a state space model of the system (A, B, C, D) and the Kalman filter 
gain K. The problem of identification of the coefficients of P(z) and Q(z) for the purpose of 
identifying a state space model and the Kalman filter gain is treated in Refs. 1-3. 

2. A Model for the Stochastic Portion of the Response in the Z-Domain 

In the present case, since the system model is assumed to be known, the problem is 
considerably simplified. The coefficients of P(z) can be determined directly from A (z) which is 
the linear difference model for the stochastic part of the output is given in Eq. (8). 

A(z)Y 2 (z) = E(z) (25) 


A(z) = / - £c( A - KC)'-' K z" = / - £a, z" 

«=! i=l 

Compare A (z) with P(z) in Eq. (23) immediately yields A(z) = P(z), hence 

N(z) = P(zr'=A(z)-' (26) 

Equation (26) states that N(z), from which the Kalman filter gain K can be computed, can be 
obtained by inverting the transfer function A (z) or P(z). Instead of direct inversion of the 
polynomial A(z) , the coefficients of N(z) can be computed from the relation A(z)N(z) = I , which 
is simply 


(/ - a t z~' - a 2 z ~ 2 - •••)(/+ A z " 1 +P 2 z~ 2 ~ — ) =/ 

Equating like powers of z“‘, i = 1, 2, ... and solving for the coefficients /J., / = 1, 2, ... 
immediately yields the same result as given in Eq. (16), e.g., 

A = « i 

& = a 2 + a,j3, 

P 2 = + OC\P 2 


Experimental Results 


The developed procedure is applied to experimental data of a truss structure at NASA Langley 
Research Center partially shown in Fig. 1 . The L-shaped structure consists of nine bays on its 
vertical section, and one bay on its horizontal section, extending 90 inches and 20 inches, 
respectively. The shorter section is clamped to a steel plate which is rigidly attached to the wall. 
The system has two air jet thruster inputs, and two accelerometer outputs. For identification, the 
structure is excited using random inputs to both thrusters. The input-output signals are sampled at 
250 Hz. A data record of 2,000 points is used for identification. 
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First, a 26-th order state space model of the system and its associated observer/filter gain are 
identified by the Observer/Kalman Filter Identification algorithm (OKID) 3 with an assumed order 
of 40. Since the retained system and filter model is only of 26-th order, the resultant residual of 
the reduced order filter is not necessarily minimized or white. This is due to the singular value 
truncation step during realization to obtain a reduced system and filter model of 26-th order. The 
procedure developed in this paper is then used to identify an improved filter gain, keeping the 
OKID-identified system model the same. 



ORIGINAL PAGE !S 
OF POOR QUALITY 


To verify that the new observer gain is in fact an improved one, the auto-correlation of the 
filter residuals at different time shifts for the first and second outputs are shown in Figs. 2 and 3, 
respectively. First, when compared with the original residuals, the norms of the filter residuals are 
reduced with the improved filter gain. This is indicated by the value of the auto-correlation 
corresponding to the number of time shift being equal to zero. Second, observe that in both cases, 
there is an general improvement in the whiteness of the filter residual with the improved gain, 
where the auto-correlation function is used as a whiteness measurement. 

It is important to view the results in light of the limitations in the application of the method. 
First, only for a linear system with white process and measurement noises, the Kalman filter 
residual is minimized and white. In practice, however, this is not the case as any deviations from 
the ideal assumptions will reflect in the identification results. Second, the computation procedure 
requires that the true system model be known. This is a rather ideal assumption that is not satisfied 
in practice. Third, because the Kalman filter describes a stochastic process, even when the 
conditions outlined above are met, one can only identify the true Kalman filter gain with an infinite 
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sci of data samples. In the example shown, the system considered here is an actual truss structure, 
the data in the computation is actual test data, the state space model used is an experimentally 
identified one, and the data record is only 2000 samples long. The example illustrates the use of 
the method in providing an improved estimation of the Kalman filter gain from experimental data, 
assuming that an experimentally identified state space model is in fact a valid representation of the 
system. 



Figure 2: Auto-correlation of filter residual for the first output using 
original and improved filter gains. 



Figure 3: Auto-correlation of filter residual for the second output using 
original and improved filter gains. 


The identified filter can be used to describe the system input-output map. An overlay of the 
actual responses of the test structure, and the estimated responses provided by the filter is shown in 
Figs. 4 and 5 for each of the two outputs, respectively. The solid curves represent the actual 
measured responses of the truss structure whereas dashed curves represent the estimated 
responses. The two curves in each figure practically overlap. This reveals that the identified 
observer adequately describes the input-output relationship of the system. In fact, the difference 
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between the actual response and the estimated response is precisely the filter residual which is 
discussed earlier in Figs. 2 and 3 for each of the two outputs, respectively. 



Time (sec.) 


Figure 4: Actual and reconstructed response using identified filter for the first output. 



Figure 5: Actual and reconstructed response using identified filter for the second output. 


Concluding Remarks 

This paper presents a procedure to estimate the Kalman filter gain from input-output 
measurement data, assuming that the system parameters are known. The procedure consists of 
three basic steps. First, the stochastic portion of the response is computed. Second, the 
coefficients of a linear difference model for this portion of the response are estimated by a least 
squares solution that minimizes the filter residual. Third, the Kalman filter gain is computed from 
these model coefficients. In practice, due to the presence of other factors such as non-linearities, 
unmodelled dynamics, etc... the estimated filter gain is not necessarily the Kalman filter gain. 
Nevertheless, the identified filter gain is optimal in that the resultant filter residual is minimized in 
the least-squares sense for the given system and the given set of input-output data. 





Appendix 


Statement of Proof: 

Given the following polynomials 

P(z) = / - ^CA‘~ l Kz'‘ , Q(z) = D + '£CA , '(B-KD)z- i , A=A-KC 

;=i <=■ 

(A.l) 

M(z) = ^CA'-'Bz 1 + D , N(z) = £ CA‘ l Kz H + / 

i= 1 i=l 

where r and q are sufficiently large such that the truncation error is negligible, 

A =(A~ KC)‘ ® 0 , / > r , A-0, />? (A.2) 

It will be shown in the following that 

M(z) = P(z)~‘Q(z) (A. 3) 

N(z) = P(z ) 1 (A.4) 

Proof: 

First, for a stable system matrix A 

jTor'/fc"' = C(/z _1 + Az " 2 + A 2 z -3 + -)B 

» = 1 

= C(/z - A ) -1 fi 

Similarly, 

Y'CA'-'Kz- 1 =C(lz-A)'K 

i = 1 

^CA-'Kz - 1 =C(lz- A)~' K 

1 = 1 

CA ' (B - KD)z~ i = C(/z - A) (B - KD) 

1 = 1 

Relation (A.4) can be shown first by making use of the initial assumption regarding the stability of 
A and A in (A.2), the matrix inversion lemma, and the above equations. 


(A.5) 


P(zY' = l-^CA-'Kz^ = [/ - C7(/z - >4 AT]’' 


= I +C(lz-A~KCf' K 
= I+C(lz-AY'K 


= / + ]T G4' 'Kz~' = N(z) 

4 = 1 


To show the relation (A.3), consider the product P(z) 'Q(z), 

P(z)-'Q(z) = [/ + C(/z - A)' ' k|d + C(lz - A)-\B - AD)] 

Hence, 

P(z)- 1 Q (z)= D + C(Iz-A)~'(B- KD) + C(/z - A)' KD + C(lz - A)~' KC(lz - A )~' ( B - KD) 

= D + C(lz - Ay' KD + C[l + (/z - >1)"' KC](lz - Af'(B - KD) 

= D + C(lz - A)' 1 KD + C(/z - /4) _l (/z - A + KC^Iz - A)~' {B - KD) 

= D + C(/z - A)'' KD + C(/z - A)~' ( B - KD) 

= D + C(lz-Ay'B = M(z) (A. 6) 

Equation (A.3) is thus proved. 
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